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ABSTRACT 

Structural equation models ir.corpor ating unmeasured 
variables make possible ^he rigorous testir.g of theories previously 
difficult to test adequately because of fallible measures of the 
theoretic variables. This paper first discusses a simple causal 
model; incorporating a single unmeasured variable fcr the purpose of 
exposition. A substantive example fcllcws, ir^corporating several 
unneasured variables for which multiple indicators *ere available. 
Joieskog's LISREL model for the analysis of linear structural 
relationships was used. The problem studied concerned the processes 
of educational attainment among several ethnic groups. The most 
important su bstantive conclusion was that measurement errors differ 
between ethnic groups and that ignorirg errcrs leads to biased 
estimates of structural effects. Howeverr applicaticn of LISREL 
depends on the collection of appropriate data. Unless the data have 
been collected by appropriate procedures^ a rd unless the model is 
adequately specified^ LTSEEl is unlikely to produce definitive 
results. (Author/CTM) 



ifiifiifiiiiiiiitiitiitii^itiitii^iiiiiiitiitiitiitiitiiiiitii^^ 

* Reproductions supplied by EDRS are the best that can be made * 

* from the origiral document. * 

ERLC 



" * frn'^r/.V^^"^ HEALTH. 
eouCATION A WELFARE 

_ NATIONAL INSTITUTE OF 

0> EDUCATION 

EDUCATrON POSrrrON OR POLICY 



f 

UNMEASURED VARIABLES IN PATH ANALYSIS^^ 



"PERMISSION TO REPRODUCE THIS 
MATERIAL HAS BEEN GRANTED BY 



TO THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (ERIC) AND 
USERS OF THE ERIC SYSTEM." 



Lee M. Wolfle 



Virginia Polytechnic Institute and State University 



00 
CO 

00 




* This paper was 
meetings of the Ameri 
San Francisco, April 
in part supported by 
Statistics, Departme 
(No. 300-78-0561). 
tions using facilit 
and State Universit 
Moment provided hel 
are gratefully ackr ow 



repared for presentation at the annual 
can Educational Research Association, 
3-12, 1979. Work on this project was 
^he National Center for Education 
z of Health, Education and Welfare 
_:san Rothschild assisted in computa- 
of Virginia Polytechnic Institute 
lorthern Virginia Campus. Maureen 
ui suggestions and criticisms which 
) ^*dged. 




UNMEASURED VARIAELES PATH ANALYSIS* 



Lee M. Wo 1 fie 

Sr.ructmral equation models r.ave been useful im estimating parameters 
of rnriv. subsrarrT-rive problems in C-iucnclonal research. Such models have 
been a-'^lli.d : re stpdy che effect f educational a .aiinnent in Inter- 

7 ■ 

genera: -ona< Tr rupatiCTial mobilicy (Blau and Dune • 1967), the social 
psycho ^^r:ic£il effects of one's be^t friend' coll e plans on the 
responarir - :x mrther jucation CT>.ii:ri:-n, Hal^-r, : ortes, 1968), the 

effect g: * -renc?' an teachers' -^ur^gemenr uv , ^ucational attain- 
ment .Sev :zz ^*ause>-. 1975), stt. ?thnic -n: li psychological 
effec-ns : £ir_^uu::nic ai::iv„svemeu^T ^.Anrderson -and E -ns^ 1974; Anderson, 1978). 

\ tr i cite* r-'ilyses 'iniiked, raosr szi,. ^7:== incorporating 

regr^-v.^'-^ f -ructrrra --uatic: -^ocedrres ve r^ested upon the impliciit, 
but imro -..^.s^mp- th<i e inde ^naent r _ables were measured 

without >r e sti .,.alo^.., 196 -9). J- pr .r..c/, -measurement errors 

in ind wV- ^ -/ariables *^iave r ^en ignored, because it, was felt that ignc/.^a; 
random .m£a?>iwre~ent error:- inerel^^ a to conclusions inore conservative than 
woulii - ' • .ii be tat case. 'or example, it is well known that least-scjisres 
estimat-'H) edures yield at-enuaced estimates of ;he regression slope amd 

correla; i^dii iroo f f icient in the ^r^'-T^iate case (see Appendix). Thus, it has 
been belra^ tnat such result^ uucjerstaiire the true relationships. In the 



* Tiiif piioer was prepcrefl^ nrr presentation at the annual meetings of 
the American E<f ucational Researrzii Association, San Francisco, April 8-12, 
1979. Work .-5n this project was; in part supported by the National Center 
for Educatic:! Statistics, DepairtnKnt of Health, Education and Welfare 
(No. 300-78^05' Susan Roth-'Ju-id assisted in computations using 

facilities i::r irginia Poly tecttani-:: Institute and State University, 
Northern V^rtri-^ia Campus. Majrrefi^ Moment provided helpful suggestions 
and crltic:^^^ ^hich are graiLt .^y acknowledged. 



case of multiple .nd^pendent van^u^les, ho^sver, the existence of measurement 
er^-ors becomes a ^serious probi^ Hlalock, 1965; Bohrnstedt, 1969). 

From £ fonnain polmi of viLe*^ assuming measurement error in structural 
equation moifels iis anuch the saiL.c assuming variables to be unobserved, 
ilir fornex ca&e^ one assumes cfrst the true variable is observable, but 

with error; in the latter, assumes :hat the true variable is 
-onobservable, am.' -j^es in its plar_a one or r -re inmerf ectly measured 
rmdiinaCDrs. Beciuise t ik variabii=:i may never be measured exactly, in a 
3rr ./ct 3enj«e ai:. tn- - variahl-^ ^.Te unobservec. In practice, then, 
tJoservarim;^ arti colHected on i^ailifest variabliL.s thought to be related 
:o the lat»^'rrc: "^ariable of real theoretical intsr^st. 

=^leawiremer ^rror« and unobsHtTred vari^'^^ue constitute a large tnpic. 
Indeec th^r fir*:!::: of iv-^ycnomc rrics addresses n seif almost entirely to the 
probJem rors in var\;.ible3. r: --^-ciology. .-^bs-tantial efforts — 

under wa^ rrj estim^.re error in d^.ta ollectior 5cr example, Schumax -znd 
Presser, 19^:^), ;xmd in e^ imation ^procedures (fcr exsanple, Blalock, .ie^lls 
and Carr-r, I ^r'- W.Iey, 1973). Tr. path analysis, medels containing 
unobserved vsriahles hav^ been a -nnrt of the literature for years (Hcr^ge 
and Treimar. l9f^B; and Hoc^, 1968; Duncan, 19^9b; Hauser, 1969; 

Land, 1970; Wiley nc Wi^c-y, 197rr; Hauser and Goldber^rer, 1971; Duncan, 
Featherman id^I inc^i:, i9^I2; Haunctr-, 1971; Otto and F-atherman, 1975.; 
Bielby, Hauser ^ ' -i Fea- V:erTnan, 1977). Indeed, one c the earliest 
substantive aptnl :k .^f pa-h analysis was by Sewell Wright (1925 » to 

the interacticin b^^itru^^L^n corn crops and hog prices, sr^d included hog 
breeding variables ^^dh were unobserved. 

The applicaticirr ^ structural models incorporating unmeasured 
variables may IsitU^ct :he explanation of educational phenomena. 
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As Kerlinger (1977) pointed out, models including unmeasured variables make 
possible the rigorous testing of theories previously difficult to test 
adequately because of fallible measures of the theoretical variables. 
And as Cooley (1978) noted, such models now define the "state of the art" 
in educational research. Unfortunately, both expository articles and 
reports of substantive applications of structural equation models incor- 
porating unmeasured variables have been rare in educational research 
literature. This paper first discusses a simple causal model, incorporating 
a single unmeasured variable for the purpose of exposition. A substantive 
exannple will follow, incorporating several unmeasured variables for which 
multiple indicators were available. This paper thus extends the work of 
Woifle (1977) and Williams (1978), who provided introductions to the 
subject of path analysis from the perspective of regression analysis, 
Woifle (1978), who exposited path analysis as a means of substantive 
interpretation of data, and Anderson ( 1978) , who exposited a nonrecuxsive 
equation mc^el. 

Let us begin with a simple example. Consider a simple causal chain 
of the process of intergenerational occupational mobility from father's 
socioeconomic status (X^) , to respondent's educational attainment (X^) , 
to respondent's socioeconomic status (X^). However, let us revise the 
model such that true educational attainment is not directly observed. 
Instead, its observed indicator, educational attainment, is contaminated 
with errors of measurement. We assume that the amount of education actually 
recorded is caused by the respondent's true educational attainment, in 
addition to several other factors. For example, the respondent may be more 
or less ignorant of the number of yenrs o^ regular school or college he or 
she completed and got credit for." The respondent may tend to round off 



educational attainment to even years, or multiples of four (such 

16, or 20). Some respondents may wish to appear to have acquire • (otr 

less) schooling tlian was actually the case. 

The complete model consists of three equations, r.he firs.: ro!i 
which descrribes the fallible measurement of observed education vJullo* tbe 
otfier two r:epresent the causal model as such.. The three equat ^u ^ may 
be written: 

^2 = ''2n'^ ^ 

n = b^3X3 + u, 

X = b, n + V, 
1 In 

where x^ is the respondent's Duncan (1961) socioeconomic inaex <\ore • as 
revised by Hauser and Featherman (1977), x^ is the respondent's ?Le<>^^^^rde<^ 
educational attainment, x^ is respondent's father's Duncan soc iotr. omor 
index score, n is true educational attainment, and e, u, and v res^cfaial 
errors. All of these are measured as deviations from their nk he i's 

are, therefore, regression coefficients, and b^^ = 1.00. The . ur.-^il 

relationships may be diagrammed, as sho\^ in Figure 1. 




Figure 1. A Causal Chain of TntergenerationaL Status Attaiip^ent 



A remarV 3ti,i^ property of this model is that unbiased estimates of 
* ^jit coeffiiiieniitiS anay be obtained for both the cau&al parameters and the 
MasKurementr -T-racess. This occurs because the oiodel is overidentified, 
aii«i hecEKse -r some, restrictive assumptions inpn^sd: on the ejected 
Rsmi&ciJiz v.m^ ")f chre residual errors with other varu-ahles in the model. 
Tha-^jasna^ speeclf ication in such moc^ls is that zht resir errors are 
uncorTre I g t">r . j?. tbe population with other, pred=tei~ziirr«^^^' .;i:r::ableo in the 
equal 1 l^.UL. 

i:i^.u, = E(nv) = E(x^v) - 0. 

In actUiti n. it is assumed that the residual error '-^^ '^asurrement is uncor- 
rtlsr : i ± - he true score, n, and also with both - ,-nd x:^: 



Thea^ ^^rronr assumptions are roughly equivalent to -ssuming the error of 
meais\are7nent random, and not systematic. Thes^ oversimplifying assump- 
ti3r:is are prooerties of the model, not necessarily of what the world is 
reA,' I7 like In any realistic context, these assmmptions are problematic, 
ar; aust be assessed against the researcher's kn wledge of the topic under 
imnKstigation. For example, it is possible that respondents whose fathers 
art: employed in occupations of low socioeconomic status, or who themselves 
ar*^ employed in such occupations, tend to overstate their educational 
attainment. Complex models can be constructed whirrh permit the inter- 
correlation of residuals, but the simple alternatives for this simple example 
^re either to abandon the exercise or to accept tlx*; ^restrictive assumptions. 

The three equations in the model may be reduc:^ to three normal 
e<sBLiations with three unknowns. The details of theat computations are 
shown in the Appendix. For purposes of illustration, the model was estimated 
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1 ^>b) = E(x^e) = E(x2e) = 0. 



with data takan frujc rhe 1977 general _:ocial survey of the jiacional Opinion 
RespFrch Center (^.977), for whites and blacks. The correl^rLoras, means anc 
stanjdard deviatiaT^^>5 are shown in Table i, and the results ir liiable 2. 

The upper pamebul of Table 2 shows rhe results one wou7 . obtain 
witl\ ::rdlnary leasx squares (OLS). ".lie structuraJ. coef flci. timts (regression 
coep^.cients in t:^eir original Tnetri:^) are shown, and beloic- =ach in paren- 
thesis are the stanrdarr-H-zed (path) coefficients. When connsaz±ng aocioeco- 
'^ocic rreturns acrn^ ^nzups, the structural coefficients sbancild be used 
ifie Kim and Mueller. 1976). The coefficients of determinatijDn are shown 
the right-hand co. imc For whites, one would conclude tnat one point of 
i bar's socioeconomic index (SEI) returns about .05 years of education, 
a**' that one year of -^:ducation was converted into about 4.5 points of respon- 
derc's own SEI. Fo lacks also, one would conclude that one point of 
fa her's SEI yields .bout .05 years of education, but that blacks were able 
tt- ronvert one addi r.al year of education into only 3.4 points of their 
own SEI. 

The lower panr-^-L of Table 2 shows the results tha t one would ob tain 
from the model diagramed in Figure 1. Note that measurement errors seem 
to be larger for blacks than for whites; that is, the standardized coeffi- 
cient relating true education to observed education is larger for whites 
than for blacks. Comparing the OLS estimates to the corrected estimates 
for the regression of education of father's SEI, one should note that the 
OLS estimate is identical to the corrected estimates. Random measurement 
error in the dependent variable does not bias the OLS estimate. However, 
random error of measurement in the independent variable imparts a down- 
ward bias to the OLS estimate. And the lower the precision of measurement. 



Table 1. Correlations, Means, and Standard Deviations of Sjisrus 
Variables; l>fhites Above Diagonal (N = 12II3\ Blacks Below 
,t Diago-:al (N = 172). 





Father's SEI 


Education 


SEI 


White 






h 


^2 


h 


Mean 


S.D. 


Father's . 
SEI ^^3^ 


1.00 


.370 


.263 


28.05 


23.57 


Education (X^) 


.255 


1.00 


.570 


11.83 


3.14 


SEI (Xp 


.279 


.560 


1.00 


37.48 


24.53 


Black Mean 


14. 48 


10.43 


23.39 






Black S.D. 


17.90 


3.66 


21.85 
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T^.«= 2, Ordinary Least Square and Corrected Estimates of 
ParaB£i.£iers in a Causal Chain Model of Intergenerational Status 

Attainment 



Predetermined Variables 



Dependent! 

Variablie 
r 



Observed 
Educatinm (X^) 

SEI (XO 



Observed 
Education (X^) 

SEI (X^) 



True 

Education (n) 



Father's SEI 
(X3) 



Observed 
Education 

(X2) 



True 
Education 
(n) 



.0493 
(.370) 



.0521 
(.255) 



.0493 
(.413) 



Ordinary Least Square Estimates 



Irfhltes 



• * • 

4.451 
(.570) 



Blacks 



3.346 
(.560) 



Corrected Estimates 



Whites 



.137 
.325 



.065 
.314 



.171 



Observed 
Education (X^) 

SEI (X^) 



True 

Education (n) 



.0521 
(.356) 



1.000 
(.895) 

5.549 
(.637) 



Blacks 



,801 
.406 



.127 



Observed 
Education (X^) 

SEI (X^) 



• • • 



1.000 
(.715) 
6.546 
(.783) 



.511 
.613 



Note: Standardized (path) coefficients appear in parentheses. 
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the greater the downward bias. For whites, the corrected regression 

coefficient leads one to conclude that one year of true educational 

attainment was converted into 5.5 points of SEX. When measurement errors 

were ignored for whites, SET returns to education were underestimated 

by about 20 percent, assuming the measurement errors were purely random, 
r 

Fot: blacks, the corrected regression coefficient leads one to conclude 
that one year of true educational attainment was converted into 6.5 
points of SEX. When ^measurement errors were ignored for blacks, SEX 
returns to education were underestimated by about 49 percent, again 
assuming the measurement errors were well behaved. 

This analysis was based on the assumption that the only kinds 
of measurement error for both blacks and whites were random, and not 
systematic. Our conclusions are, therefore, contingent on the correctness 
of those assumptions, but suggest that previous studies (for example, 
Duncan, 1969a) comparing the processes of status attainment for whites 
and blacks have exaggerated racial differences in returns to education 
by failing to account for measurement error. Bielby, Hauser and 
Featherman (1977) estimated status attainment models for whites and blacks 
incorporating both structural and response components. They found that 
response errors for whites were random, but were not for blacks. Nonethe- 
less, the substantive consequences were the same as here: ignoring 
measurement errors exaggerated racial differences. 
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STRUCTURAL EQUATION MODELS WITH MEASUREMENT ERRORS 

Unmeasured variables may also be included in causal models in the 
more general case in which unmeasured variables appear as underlying 
causes of several observed variables. This tyoe of model translates 
irvto a confirmatory factor-analysis model with an assumed structural 
order among the factors. A general method has only recently been 
developed which incorporates recursive path analysis, interdependent 
econometric models, factor analysis, and analysis of covariance 
structures. This method, the analysis of linear structural relation- 
ships, or LISREL, was introduced by Jbreskog's (1973) technical paper, 
and has been updated in JSreskog (1977). Less technical introductions 
are available in Long (1976) and JSreskog and SSrbom (1978). A computer 
program is available (Jbreskog and S'drbom, 1978). 

The LISREL model assumes a causal structure among a set of 
unmeasured, latent variables, some designated as exogenous and others 
as endogenous. These unmeasured variables are also related to a set of 
observed variables such that (in the example to follow) the latent 
variables appear as causes of the observed variables. The LISREL model, 
therefore, consists of two parts: the measurement model, and the 
structural equation model (J'dreskog and SSrbom, 1978: 3-4). 

By way of example, Lichtman and Wolfle (1978) are studying the 
processes of educational attainment among several ethnic groups, 
including whites, blacks, and Hispanics. They proposed to compare 
structural equation models among ethnic groups in order to determine 
the extent to which differences exist in the educational returns to 
socioeconomic background and within-school variables. The population 
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under study is the high school graduating class of 1972, 
described in detail by Levinsohn, et al. (1978). One should expect 
that members of this high school cohort have not all completed their 
education as of 1976 (the latest follovTup), so the results pertain to 
educational returns as attained four years after graduation. The 
respondents were initially surveyed in 1972, and followed up in 1973, 
197A, and 1976. Because some questions were repeated in various 
questionnaires, or because alternative means exist for constructing 
manifest variables, in many cases multiple indicators exist for 
latent variables. This becomes important in light of Bohrnstedt and 
Carter's (1971: 142) admonition that, "sociologists seem to be blatantly 
unconcerned with the problems of measurement error." Moreover, Bielby, 
Hauser, and Featherman (1977) showed that differential measurement 
errors existed between blacks and whites in the 1973 replication of 
Blau and Duncan (1967), thus leading to exaggerated racial differences 
in models ignoring measurement error. 

Fcr the expository purposes of this paper, a preliminary model 
incorporating structural associa*-ions among several latent variables, 
and components of measurement errors, has been constructed. This 
model, shown in Figure 2, includes two manifest measures for father's 
educational attainment, mother's educational attainment, and the 
respondent's high school curriculum. Three manifest measures are 
included for respondent's educational attainment. Single manifest 
variables measure father's socioeconomic index, the number of siblings, 
and high school class ranking. The LISREL model specifies that each 
manifest variable is generated by a latent factor for that variable, 
plus a response error which is independent of the latent factor. In 
IJSRKL tormlnolof'.y: 




L Program Program 

, Y. 

1 2 



. 0 Father's [ 
h *SEI Xj * — 



L — ►FaEduc 
/ z 



6^ — »faEduc X^ 



fi.-^MoEduc X, 

4 4 



— ►MoEduc X^ 



Number of 



. Q ^Humbe 

6 Siblings X 




^2 Percentile 
Rank Y„ 



0 



1^ 



^Figure 2. Structural Equation and Measurement Models of Educational Attainment, 1972 High School 
£Y(iC Graduates 



two random vectors n' = (n^^, r]^^ n^)* and 5' = (C^, E.^^ 
represent the latent endogenous and latent exogenous variables, 
respectively. The model specifies a fully recursive causal structure 
among the latent variables, such that: 
3 n = r + C 

whfjpre 6(3 x 3) and r(3 x 4) are matrices of structural coefficients 

in which F is a full matrix relating the exogenous vector to each of 

the endogenous latent variables, and 6 is a matrix relating the 

endogenous variables to each other. C' = (C^, t^^i ^3) a random 

vector of residuals uncorrelated with 

The vectors n and C are not observed, but y' = (yj>...,yg) and 

x' = (x,,...,x^) are observed, such that: 
« 1 o 



y = A n + e 
i .y . 



and 



X = A ^ + 6 
^x ^ 

where e and 6 are vectors of errors of measurement in y and x, 

respectively. These errors of measurement represent both specific and 

random components of variation (see Alwin and Jackson, forthcoming). 

They are assumed to be uncorrelated with n, and C, but may be 

correlated among themselves. The matrices A (6x3), and A (6 x 4) 

**y «x 

are regression matrices of y on n and of x on C> respectively. 

Let $(4 X 4) be the covariance matrix of Let ^(3 x 3) = 
diag(ij;jj, ^22* ^33^ variance matrix of C Let 0^ and 0^ 

be the covariance matrices of e and 6, respectively. In application, 
some of the elements of the four regression matrices, and the four 
covariance matrices, are fixed and equal to assigned values. Other 
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elements are free parameters to be estimated by the method of maximum 
likelihood. This defines the LISREL model. 

The structural model is presented in the path diagram of Figure 2, 
The variables enclosed in ellipses are unobserved, latent variables. 
The manifest variables included in the model are as follows, in which 
th|^ number in parentheses refers to tha variable number as given in 
Levinsohn, et al. (1978): 



= father ' s socioeconomic index (V2468) , 
= composite measure of father's education (V1627), 
X^ = father ^s iiducation (V1009), 

X^ = composite measure of mother's education (V1628), 
X^ = mother's education (VIOIO), 

X = sum of older and younger brothers and sisters 
6 

(V1460 + V1461 4- V1462 + V1463), 
= high school program as reported by respondent ( = 1 if 
academic, = 0 otherwise) (V209) , 
^2 " high school program as reported by school record ( = 1 if 
academic, = 0 otherwise) (V196), 
= percentile rank in class (V631), 
Y, = educational plans as of 10/1/76 (V1855), 
Y^ = educational attainment as of 10/1/76 (V1854), and 
Y^ = educational recode (Melone, personal correspondence). 
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The substantive portion of Figure 2 is a fully recursive model 
among the latent variables, represented by the following structural 
equations: 

^ = ^11^1 ^12^2 ^ISS ^14^4 S 
n2 = Y2if;i + y^l'-l ^23^-3 ^24^4 ^21^ h 
y n3 = Y3^Cj + Y32C2 + + Y3^?^ + 33^0^ + 33202 + C3. 

In algebraic form, the measurement portion of Figure 2 is: 

^2 = ^22^2 S 
X3 = X32?2 + -53 

^ = ^43^3 \ 

^2 = ^21^ ^2 
^3 = ^^2 

^4 = ^3^ '4 
^5 = S3'^3 ^5 
^6 = '63'^3 ^6 
A metric for the latent variables is established by fixing 

^22 ^ ^43 ^ ^21 ^ ^53 ^ '^^^^ "^^^ metric of the latent 

variables frather 's education, mother's education, curriculum, and 
respondent's education are fixed to be the same as that of the 
composite measures of education for father and mother, respectively, 
the school report of curriculum, and educational attainment as of 
10/1/76. The metrics of father's socioeconomic status, siblings, and 
class rank have already been fixed by the algebra of the measurement 
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model. Normalizations of this kind are necessary because the metric of 
an unobserved variable is arbitrary. Consequently, the regression 
slopes of manifest variables on latent variables are identifiable only 
relative to each other. 

The model was estimated with data for white male 1972 high school 
graduates (N = 2955) with the specification that the response errors 
were uncorrelated. The resulting = 150, 75, with df = 38, indicated 
that the model did not do a very good job of reproducing the observed 
variance-covariance matrix. Examination of the first-order derivatives 
indicated the possibility that the specification of uncorrelated response 
errors may have been untenable. Specifically, the response errors of x^ 
and x^ may be correlated. These variables are the composite measures of 
father's and mother's education, and apparently systematic errors of 
construction exist in both variables. Re-estimating the model allowing 
for correlated response errors between x^ and x, resulted in a = 95.00 
with df = 37. Because the difference in these chi-squares is itself 
distributed according to chi-square with one degree of freedom, it is 
obvious that the correlated response error was statistically significant. 
Yet once again the model does not do a very good job of reproducing the 
variance-covariance matrix. Re-examination of the first-order derivatives 

suggested that y^ and y^ had correlated response errors. Re-estimation 
2 

yielded a x = 73.85, with df = 36, which became the final model because 

the addition of the next most likely correlated response error did not 

2 

significantly reduce the value of x (see Sbrbom, 1975). 

Identical models were also estimated for black males (N = 257) with 

2 9 
X = 43.45, with df = 36, and for Hispanic males (N = 125) with x = 51.59, 
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with df = 36. Estimates for the measurement model are shown in Table 3 
for white males. Table 4 for black males, and Table 5 for Hispanic males. 
Shown in column 3 of these tables are the standard deviations of manifest 
variables; column 4 contains the standard deviations of response errors 
not accounted for by the underlying latent variables; column 5 shows the 
standard deviations of the latent variables; column 6 contains the 
relative slopes of the manifest variables regressed on the latent 
variables; and column 7 shows estimates of the reliability coefficients. 

Among white males in the NLS sample, different reports of the same 
underlying variable were likely to have different slope coefficients. 
For some variables, such as curriculum track, these different sloptes 
indicate different fits between the manifest and latent variables. For 
example, among whites the school record measure of curriculum tracK was 
a more reliable indicator of the true variable than was the student's 
own report. For other variables, different slopes reflect different 
scales of the manifest variables. For example, the composite measures of 
parental education were scaled from 1 (less than high school) to 5 (MA, or 
PhD), while the first followup questions of parental education were scaled 
from 1 (none, or grade school only) to 9 (PhD or equivalent). However, 
the reliability coefficients for these variables indicate that the 
composite measures of parental education, which were based on responses 
to baseyear, first followup, and activity state questionnaires, were^siore 
reliable measures of the underlying latent variables than were the first 
followup questions alone. However, caution should be exercised in 
generalizing from these preliminary results. The fact that the two 
measures of parental education differ in their scales of measuremeirt may 
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Table 3. Measurement Model Parameter Estimates for I^mice Male 1972 High School Graduates (N = 295^ 



Variable 



True (x.) Observed 



Obsarved 
SD 



SD of 
Error 



SD 
Tri 
Scr 



Relative 
Slope 

A . . 



Reliability 
Coefficient 

(a2 /a2)A.. 

T . 1 1" 

J : 



FAEDUC 
5o 



X2 (V1627) 

x^cvioog) 



1.28 
2.22 



.44 (.11) 
.49 (.19) 



1.20 



1.00 

1.81 (.02) 



.88 
.53 



MOEDUC 



X, (V1628) 
4 

X^(VIOIO) 



1.01 
1.73 



.46 (.11) 
.34 (.20) 



.90 



1.00 

1.87 (.03) 



.79 
.51 



cuRRia'i^-;-.^ 



Y^(V209) 
Y2(V196) 



.50 
.50 



,33 (.06) 
,28 (.06) 



,41 



.91 (.02) 
1.00 



.61 
.67 



EDUCATIDN 



Y^(V1855) 
(V1854) 
Yg (recode) 



2.30 
1.97 
.70 



1.20 (.25) 
.82 (.20) 
.29 (.07) 



1.79 



1.10 (.02) 
1.00 
.36 (.01) 



Note. — Standard errors of parameter estimates appear in parentheses 
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.67 

.83 

* * * 
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Table 4. Measurement Model Parameter Estimates for Black Male 1972 High School Gr:ad\iates (N = 25 



Variable 



True (t^) Observed 



Observed 
SD 



a. 

1 



SD of 
Error 



e . 

1 



SD of 
True 
Score 



Relative 
Slope 

A. . 



Reliabilil 
Coef f iciei 

(o2 /a2)A 

T. X 3 



FAEDUC 



X2(V1627) 
X^(V\009) 



.94 

1.67 



.48 (.19) 
.13 (.35) 



,82 



1.00 

2.04 (.12) 



.76 
.49 



MOEDUC 

5o 



X^(V1628) 
X5(V1010) 



.97 

1.63 



.67 (.23) 
,37 (.45) 



.71 



1.00 

2.34 (.22) 



.54 

.44 



CURRICULUM 
"^1 



Y^(V209) 
Y2(V196) 



.47 

.46 



.36 (.12) 
.26 (.13) 



.38 



.79 (.10) 
1.00 



.52 
.68 



EDUCATION 



Y^(V1855) 
¥^(71854) 
Y. (recede) 



2.16 
1.88 
.64 



1.16 (.51) 
1.01 (.41) 

.19 (.15) 



1.58 



1.15 (.09) 
1.00 
.38 (.03) 



.62 

.71 

* * * 



Note. — Standard errors of parameter estimates appear in parentheses. 
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Tabl^ 5. Measurement Model Parameter Estimates for Hispanic Male 1972 High School Graduates (N = 



Variable 



Observed 
SD 



SD of 
Error 



SD of 
True 
Score 



True (Tj) 



Observed 



Relative 
Slope 

^13 



Reliabilit 
Coef f icien 

(o2 /o2)A, 



FAEDUC 
^2 


X2(V1627) 
X2(V1009) 


1.08 
2.05 


.53 
.87 


(.25) 
(.47) 


.93 


1.00 

1.99 (.17) 


.74 
.41 


MOEDUC 


X, (V1628) 
(VIOIO) 


.82 

1.51 


.34 
.33 


(.13) 
(.32) 


.75 


1.00 

1.98 (.13) 


.84 
.49 


CURRICULUM 


Y^(V209) 
Y2(V196) 


.41 
.46 


.33 
.33 


(.13) 
(.15) 


.32 


.76 (.15) 
1.00 


.46 
.48 


EDUCATION 


Y, (V1855) 


2. 19 


1.23 


(.65) 




1.33 (.18) 


.51 


113 


Y5(V1854) 


1.77 


1.12 


(.49) 


1.36 


1.00 


.59 




Y, (recede) 


.54 


.32 


(.16) 




.32 (.04) 


* * * 



Note. — Standard errors of parameter estimates appear in parentheses. 



ERIC 



indicate that the two observed variables are not manifestations of the 
same true score. 

Caution is particularly recommended in interpreting the results of 
the measurement of respondent's education. The preliminary model estimated 
here included two measures of education measured on the same scale: 
educational plans as of 10/1/76 and educational attainment as of 10/1/76. 
A third composite measure was included, one suggested by the staff of 
NCES in which 1 = no higher education, 2 = some college, and 3 = BA and 
higher. The relative slope of this composite measure varies so much from 
the other two manifest variable regression slopes, and its error variance 
is so different, that it should probably not be viewed as a manifest 
component of the same latent factor that underlies the other two education 
variables. One result of the mismatch is a meaningless reliability 
estimate greater than unity. 

One further caution of interpretation is worth noting. Classical 
true score models express an observed score in terms of two orthogonal 
components composed of a true score and an error score. As a result, 
errors based on true score models are uncorrelated with true scores and 
other error scores. However, the vectors of residual factors, 6 and £, 
contain both measurement error and reliable variation specific to each 
manifest variable (Alwin and Jackson, forthcoming). As a result, it is 
possible for some or all of the residual errors to be correlated even in 
the population, much as we have seen that the errors of and x^, the two 
composite measures of parental education, were correlated. Apparently, 
whatever errors of measurement entered into the construction of one 
parent's education composite score also entered into the other parent's. 



Comparison of the measurement-model results for white males to those 
of blacks and Hispanics indicates that within each population the most 
reliable measure of parental education was the constructed composite 
variable. The most reliable measure of curriculum membership was the 
school record. The most reliable measure of true education was the 
respondent's report of his educational attainment as of 10/1/76. Across 
populations, the reliability coefficients for blacks and Hispanics were 
lower than those for whites. Both blacks and Hispanics exhibited less 
variation in the observed measures than did whites. Blacks and Hispanics 
also exhibited less variation in the latent factor scores; proportionately, 
there were even greater disparities among the latent variances than among 
the observed. As a result, the reliability coefficients for blacks and 
Hispanics were substantially lower than those of whites. 

Clearly these findings suggest caution in interpreting models of 
status attainment among minority groups that do not take account of 
response error, especially when comparing structural coefficients across 
groups s Table 6 presents ordinary least square (in parentheses) and 
corrected LISREL estimates for the structural equation portion of the 
model represented by Figure 2. Comparison of these estimates provides 
some indication of the biases encountered when measurement errors are 
ignored. (Another example has been offered by Bielby, Hauser, and 
Featherman, 1977.) 

First, the ordinary least squares regression of educational 
attainment on four family background variables and two intervening 
measures of high school effects accounts for two-fifths of the variance 
in educational attainment for white males, but only one-fourth of the 



Table 6. Corrected (LISREL) and OLS Estimates of Parameters of the Educational Attainment Process 

for 1972 High School Graduates 



Predetermined Variables* 



VAriahlp 


laoiiL 


rdijGUC 


notduc 


NumSibs 


Curriculum 


H.S. Rank 


„2 
R 


l^ites 
















Curriculum 


.002 


.084 


.052 


-.025 


• < 1 


1 1 • 


.165 




(.002) 


(.068) 


(.049) 


(-.026) 


• « • 


1 « • 


(.109) 


K.S. Rank 


-.032 


.587 


-.362 


-.160 


37.787 


1 • • 


.325 




i.m) 


(1.303) 


(-.038) 


(-.482) 


(25.041) 


It! 


(.232) 


- Education 


.002 


.194 


.095 


-.063 


2.271 


.012 


.558 




(.003) 


(.237) 


(.092) 


(-.071) 


(1.326) 


(.020) 


(.400) 


Blacks 
















Curriculum 


.003 


.063 


-.008 


-.031 


III 


III 


.086 




(.002) 


(.056) 


(.004) 


(-.018) 


• 1 • 


1 1 1 


(.050) 


H.S. Rank 


-.074 


-2.3U 


2.952 


.723 


34.045 


1 t • 


.237 




(-.032) 


(-2.306) 


(3.115) 


(.552) 


(24.198) 


• t • 


(.193) 


Education 


.012 


.010 


M 


-.021 


1.825 


.009 


.411 




(.015) 


(.013) 


i.m 


(-.043) 


(1.068) 


(.016) 


(.249) 



Table 6. (continued) 



Dependent 
Variable 






Predetermined Variables* 




■ r2 


FaSEI 


FaEduc 


MoEduc 


NumSibs 


Curriculum H«S« Rank 


• 

Hispanics 














Curriculum 

VUL X SAX UlU 


.004 


- nnn 


-.021 


.002 


• « 1 III 


.062 




(.003) 


(-.018) 


(.041) 


(.006) 


< • « III 


(.025) 


H«S. Rank 


•014 


-.212 


2.996 


1.301 


50.983 


.407 




(.147) 


(.341) 


(.172) 


(U227) 


(26.166) 


(.255) 


Education 


-.012 


-.053 


.168 


-.072 


2.770 .010 


.582 




(-.006) 


(-.096) 


(.018) 


(-.110) 


(.558) (.028) 


(.249) 



Note. - The ordinary least squares estimates appear in parentheses below the LISREL 
estimates. The variables used in the OLS regressions were Father's SEI (X^: V2468), Father's 
Education Composite (X^: V1627), Mother's Education Composite (X^: V1628), Number of Siblings 
(X^: V1460 - V1463), Curriculum (= 1 if Academic; = 0 otherwise} (Y^: V196), High School 
Percentile Rank (Y : V631), and Educational Attainment as of 10/1/76 (Y.: V1854). 



variance for both blacks and Hispanics. However, these results confound 
measurement error with true variation and result in coefficients of 
determination that understate the ratio of explained to total true 
variance by nearly 30 percent for whites, but by nearly 40 percent for 
blacks and 60 percent for Hispanics. By taking measurement errors into 

adjpount, over half the variance in true educational attainment can be 

2 2 
explained for Hispanics (R'^ = .582) and for whites (R = .558), but not 

for blacks (R^ = .411). 

Comparison of the OLS to corrected estimates in the regression of 
educational attainment again indicates the biases due to ignoring measure- 
ment errorse In particular, notice that ignoring measurement errors does 
not necessarily produce attenuated estimates . Indeed, some of the OLS 
estimates are substantially larger than the corrected estimates. For all 
three groups, corrections for measurement error produce nearly identical 
effects, at least in the direction of the bias. There appear to be 
downward biases (the OLS estimates underestimate the corrected estimates) 
for mother's education and curriculum track. There appear to be upward 
biases for father's education, the number of siblings, and high school 
class ranking. There is also an upward bias for father's socioeconomic 
index anvong whites and b.lficks, but for Hispanics the OLS estimate for 
father's SEX understates the corrected estimate. 

Although thit direction of bias is nearly uniform across ethnic 
groups, the magnitude is not. One example is provided by the effect of 
membership in an academic track on educational attainment. Ordinary 
least squares regressions would indicate that membership in an academic 
track results in about one and one-third additional years of education 
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for whites (measured four years after high school graduation), and one 
additional year for blacks, but only one-half year for Hispanics. However, 
when the confounding effects of measurement errors are removed, Hispanics 
are estimated to convert membership in a high school academic track into 
2.8 additional years of education, while whites have a comparable estimate 
ofy2.3 years, and blacks 1.8 years. 

Another example is provided by the effect of high school rank on 
educational attainment. Ranking the magnitude of the OLS estimates would 
lead one to conclude that Hispanics were best able to convert increased 
high school class ranking into educational attainment, followed by whites, 
then blacks. However, when corrected for tneasurement errors^ all three 
groups were apparently equally able to convert class ranking into educational 
attainment. 

Overall, the consequences of ignoring measurement error appear to be 
greater in the case of Hispanics than of either whites or blacks, and 
greater for blacks than for whites. Since the biases in structural 
estimates ignoring measurement error are larger among Hispanics and 
blacks than among whites, uncorrected ethnic comparisons show unrealistically 
large differences between ethnic groups in the effects of familial background 
ana high school process effects. 

CONCLUSIONS 

Educational researchers have long known that ignoring measurement 
errors will lead to biased estimates of structural effects. However, until 
recently multivariate analytic procedures which correct for measurement 
errors were not generally available. Recent developments by JBreskog 
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and SBrbom (1978) have made available a general computer program that 
permits estimation of structural effects corrected for measurement 
errors. The application of these techniques to a substantive problem 
in education has indicated the advantages of the LISREL approach, along 
with several cautionary reservations • 

\ The most important substantive conclusion inherent in this 
analysis supports the findings of Bielby, Hauser and Featherman (1977): 
measurement errors differ between blacks and whites; ignoring them 
leads to biased estimates of structural effects. Moreover, the present 
analysis shows that Hispanics also report data with inherent measurement 
errors, and ignoring them will lead to estimates even more biased than 
among either whites or blacks. 

Another set of substantive conclusions could be drawn from the 
estimated parameters of the Hispanic model of educational attainment. 
To the best of my knowledge, these are the first estimates, unbiased by 
measurement error, of the process of status attainment for any ethnic 
group in America other than whites or blacks. However, I have refrained 
from discussing Hispanics because the model explicated in this paper was 
a preliminary construction, and is already outmoded. In particular, the 
model omits measures of ability, and as Scarr and Weinberg (1978) 
demonstrated, the omission of ability leads to spurious estimates of 
causal effects. 

Finally, a cautionary note is in order. Kerlinger (1977) correctly 
pointed out that the LISREL approach toward multivariate analysis 
contains a great deal of promise for testing theories that have been 
difficult to test adequately with previously available analytic procedures. 



30 



However, application of LISREL (indeed, any analytic procedure) depends 
upon the collection of appropriate data. Specifically, measures of 
different variables must be ascertained on diff erent occasions , or by 
different means, data collection procedures that can be considerably 
more expensive than the usual survey. 

My own view of the utility of LISREL is more skeptical than 
Kerlinger's (1977). Unless the data to be analyzed have been collected 
by appropriate procedures, and unless the model is adequately specified, 
LISREL is unlikely to produce the definitive tests Kerlinger suggests ar 
possible. The past decade has seen recursive path analytic procedures 
faddishly applied to implausibly constructed models. Except for the 
inherent difficulties in specifying the model of the computer program, 
the next decade may see implausible examples of substantive analyses 
based on LISREL. In the past six months I have twice had manuscripts 
returned to me with reviewer's naive suggestions that the problems they 
recognized could be solved by reanalyzing the data with LISREL. They 
could not. As Cooley (1978: 13) so insightfully pointed out last year, 
more important than number crunching is the careful measurement of a 
few "right" variables, variables that permit statistical controls for 
major alternative explanations. Data analysis may stimulate thinking, 
but it is not a substitute for it. 
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APPENDIX 



The three equations which define the structural model are: 

= n + e (1), 
2 ■ zn 

n = b + u (2), 

X = b, n + V (3). 
{ 1 In 

The notation may be revised such that x^, x^, x^, and n refer 

to the standardized values of these variables. Equations (1) through 

(3) may be rewritten using the using the usual ecualities: 

p = b (a /a ) (4), 
"^yx yx X y 

p = a /a ^ (5) . 

yxi u J' 

These coefficients were termed path coefficients by Sewell Wright 
(1921). Rewriting equations (1) through (3) in terms of path 
coefficients and standardized variables yields: 

^ = Pn3^3 ^ Pnu" 

with the specifications 

ECx^u) = E(nv) = E(X3V) = 0 (9), 
E(ne) = E(Xje) = ECx^e) = 0 (10). 
To solve the path coefficients in equations (6) through (8), we will 
multiply through these equations by one or another of the variables, and be 
taking expectations. Because the covariance of two standardized variables 
is the coefficient of correlation, taking expectations of a covariance 
will yield the population correlation coefficient, p. 
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First, multiply equation (6) through by n, and we have: 

since E(nn) = 1, and E(ne) = 0 by assumption. In similar fashion, 
multiplying equation (7) through by x^, and equation (8) through by 



n, yields: 
r 

\3 ^ ^n3 (12), 



P, = P, (13). 

Multiplying equation (6) through by x^, and taking expectations, 
yields: 

Poo = Po-P„o (1^)» 

because E(x2e) = 0, by assumption, and p^^ ^ equation (12). 

Multiplying equation (8) through by x^, and taking expectations, yields: 

Pio = Pi P o (15), 

13 '^in n3 

and multiplying equation (8) through by x^ yields: 
Pl2 = PlnP2n 

because E(x2v) = 0, an equality implied by equations (9) and (10). 

• Equations (14), (15), and (16) form three equations in three 
unknowns, and: 



Pin = '^^12^13/^23 ' 



P2n = 7^12^23/^13 

Pn3 = /''13P23/P12 
Using the sample correlation coefficients given in Table 1 to 
estimate the population coefficients in equations (17) through (19) 
give the standardized results presented in Table 2, 
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The corrected regression coefficient b^^ equation (2), as 

implied by equation (4), is given by: 

b = p .(a /a J (20), 
n3 n3 n 3 

in which 

a = p. a. (21), 
n ^^20 2 

f 

thk usual association between the true and observed standard deviations 
(for example, Gulliksen, 1950: 23). Note therefore that: 

^3 = Pn3(P2nV^3> ^''^^ 
'^^^ Pn3P2n = ^23 = ^23^ 

n3 *^23^ 2 3 

That is, the corrected regression coefficient of true education 

regressed on father's SEI is equal to the OLS regression coefficient 

one would obtain from regressing observed education on father's SEI. 

The corrected regression coefficient b^ in equation (1) is 

2ri 

given by: 

= P9n(°9/°n) ^^^^ ' 

but substitution by equation (21) reveals that 

b^ =1.00 (25). 
2n 

Finally, let's consider the corrected regression coefficient b^^ 

In equation (3). From equation (25), equation (1) may be rewritten 

X2 = n + e, and by substitution, equation (3) becomes: 

X, = b. (x. - e) + V (26), 
1 In 2 

X = b, x^ - b. e + V (27). 
1 In 2 In 

Multiplying equation (27) through by x^ and taking expectations 
yields: 

2 

^10 b^ - b, a + a (28), 
12 In 2 In 2e 2v 

2 

but a« « 0 as above, and a. = a , which may be verified by multiplying 
2v 2c c 



34 



equation (1) by e and taking expectations. Thus, 




(29); 



(30). 



and 




".2 <''2"°2 - °e» 



(31), 



Thj^t is, the corrected regression coefficient of respondent's SEI 
on true education is equal to the OLS estimate only when the error 
of measurement variance is zero. Otherwise, the greater the variance 
of errors, the greater will be the downward bias in b^^' this 
example, because"the. variance of errors was greater for blacks than 
for whites, the ULS estimates were more biased for blacks. 
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